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Abstract 

This paper recalls the development of gauge theory culminating in Yang-Mills theory, 
and the application of differential geometry including connections on fiber bundles to 
field theory. Finally, we see how the preceding is used to explain the Feynman diagrams 
appearing on the Feynman postage stamp released in May 2005. Version 2 included the 
Feynman diagrams. Version 3 corrected typos and Version 4 included an appendix for 
the derivation of the Yang-Mills transformation and field strength. Version 5 indicates 
that the article has been published in the Notices of the AMS and in July 2009 appears 
in Chinese translation in a journal of the Chinese Academy of Sciences. It also corrects 
some typos and adds to the appendix a hueristic derivation of the Yang-Mills field 
strength. 

On May 11, the late Richard Feynman's birthday, a stamp was dedicated to Feynman at the Post 
Office in Far Rockaway, New York City, Feynman's boyhood home. (At the same time, the United 
States Postal Service issued three other stamps honoring the scientists Josiah Willard Gibbs and 
Barbara McClintock, and the mathematician John von Neumann.) 

The design of the stamp tells a wonderful story. The Feynman diagrams on it show how Feynman's 
work originally applicable to QED, for which he won the Nobel prize, was then later used to eluci- 
date the electroweak force. The design is meaningful to both mathematicians and physicists. For 
mathematicians, it demonstrates the application of differential geometry; for physicists, it depicts 
the verification of QED, the application of the Yang-Mills equations and the establishment and 
experimental verification of the electroweak force, the first step in the creation of the standard 
model. The physicists used gauge theory to achieve this and were for the most part unaware of 
the developments in differential geometry. Similarly mathematicians developed fiber-bundle theory 
without knowing that it could be applied to physics. We should, however, remember that in general 
relativity, Einstein introduced geometry into physics. And as we will relate below, Weyl did so for 
electromagnetism. General relativity sparked mathematicians interest in parallel transport, eventu- 
ally leading to the development of fiber-bundles in differential geometry. After physicists achieved 
success using gauge theory, mathematicians applied it to differential geometry. The story begins 
with Maxwell's equations. In this story the vector potential A goes from being a mathematical 
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construct used to facilitate problem solution in electromagnetism to taking center stage by causing 
the shift in the interference pattern in the Aharonov-Bohm solenoid effect. As the generalized 
four- vector A^, it becomes the gauge field that mediates the electromagnetic interaction, and the 
electroweak and strong interactions in the standard model of physics - is understood as the 
connection on fiber-bundles in differential geometry. The modern reader would be unaccustomed to 
the form in which Maxwell equations first appeared. They are easily recognizable when expressed 
using vector analysis in the Heaviside-Gibbs formulation. 

Maxwell's Equations 

The equations used to establish Maxwell's equations in vacuo expressed in Heaviside-Lorentz ra- 
tionalized units are: 

(1) V • E = p (Gauss's law) 

(2) V • B = (No magnetic monpoles) 

(3) V X B = J (Ampere's law) 

(4) V X E = -dB/dt ((Faraday's and Lenz's law) 

where E and B are respectively the electric and magnetic fields; p and J arc the charge density 
and electric current. The continuity equation which dictates the conservation of charge: 

(5) V • J + dp/dt = 

indicates that Maxwell's equations describe a local theory since you cannot destroy a charge locally 
and recreate it at a distant point instantaneously. The concept that the theory should be local 
is the corner-stone of the gauge theory used in quantum field theory, resulting in the Yang-Mills 
theory, the basis of the standard model. 

Maxwell realized that since: 

(6) V • V X B = 

equation (3) is inconsistent with (5), he altered (3) to read 
(3') V xB = 3 + d^/dt 

Thus a local conservation law mandated the addition of the d^/dt term. Although equations (1), 
(2), (3') and (4) are collectively known as Maxwell's equations. Maxwell himself was only respon- 
sible for (3'). 

Maxwell calculated the speed of a wave propagated by the final set of equations, and found its 
velocity very close to the speed of light. He thus hypothesized that light was an electromagnetic 
wave. Since the curl of a vector cannot be calculated in two-dimensions, Maxwell's equations indi- 
cate that light, as we know it, cannot exist in a two-dimensional world. This is the first clue that 
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electromagnetism, is bound up with geometry. In fact equation (6) is the vector analysis equivalent 
of the differential geometry result stating that if /3 is a p-form, and d/? is its exterior derivative, 
then d(d/3), or d^/J = 0. 

Unlike the laws of newtonian mechanics, Maxwell's equations carry over to relativistic frames. The 
non-homogeneous equations, (1) and (3'), become 

(7) ^^,F^"' = r 

while the homogeneous equations, (2) and (4) become 

(8) e^P^^dpF^s = 

where e^^'^^ is the Levi-Civita symbol, F^^^ = d^^A^ — 9^^'^, and is the scalar potential and >l*'s 
(i = 1, 2, 3) the components of the vector potential A. Note that both equations (7) and (8) are 
manifestly covariant. Yan^ remarked that equation (8) is related to the geometrical theorem that 
the boundary of a region has no boundary. In a later section, we will show that equation (8) is due 
to the principle (f'uj = 0, where u; is a p-form. Yang's geometrical explanation can be understood 
in differential geometry terms using the generalized Stoke's theorem: fj^jduj = jgj^oj, where to is 
an n-form and M is an n + 1 dimension oriented manifold with boundary dM. For the purposes 
of this article a manifold is simply a surface that is locally Euclidean. Because (Pu = 0, this leads 
to (i^oj = Jq^j duj = /g2jv/ ^ = where d'^M indicates the boundary of a boundary of a region. 
If we assume that uj is non- vanishing, then d'^M is 0. The Mobius strip can be used as another 
example of the theorem Yang cites. 

Gauge Invariance 

In a 1918 article, Hermann Weyi tried to combine electromagnetism and gravity by requiring the 
theory to be invariant under a local scale change of the metric g^^ —* g^ue°'^^\ where x is a 4- vector. 
This attempt was unsuccessful and was criticized by Einstein for being inconsistent with observed 
physical results. It predicted that a vector parallel transported from point p to q would have a 
length that was path dependent. Similarly, the time interval between ticks of a clock would also 
depend on the path on which the clock was transported. The article did, however, introduce 

• The term "gauge invariance", his term was Eichinvarianz. It refers to invariance under his 
scale change. The first use of "gauge invariance" in Englislill was in Weyl's 1929 English 
versioiH of his famous 1929 paper. 

• The geometric interpretation of electromagnetism, 

• The beginnings of non-abelian gauge theory. The similarity of Weyl's theory to non-abelian 
gauge theory is more striking in his 1929 paper. 

^Yang, C.N., (1980), Physics Today, 6 42 

^Weyl, Hermann, (1919), Sitzwingsher. Preuss. Akad., Berlin, 465 

■'See Jackson, J. D. and Okun, L. B., (2001). Rev. Mod. Physics, 73, 663. 

'^Weyl, H. (1929). Proc. Natl. Acad. Sci., 15, 32. 
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By 1929 Maxwell's equations had been combined with quantum mechanics to produce the start of 
quantum electrodynamics. Weyl in his 1929 article!] turned from trying to unify electromagnetism 
and gravity to following a suggestion originally thought to have been made by Fritz London in his 
1927 articlelll, and introduced as a phase factor an exponential in which the phase a is preceded by 
the imaginary unit i, e.g., e"'"*''"^-^^ in the wave function for the wave equations (for instance, the 
Dirac equation is {ij^dfj^ — m)tp = 0). It is here that Weyl correctly formulated gauge theory as a 
symmetry principle from which electromagnetism could be derived. It was to become the driving 
force in the development of quantum field theory. In their 2001 Rev. Mod. Phys. paper Jackson 
and Okun point out that in a 1926 papeiEl pre-dating London's, Fock showed that for a quantum 
theory of charged particles interacting with the electromagnetic field, invariance under a gauge 
transformation of the potentials required multiplication of the wave function by the now well-know 
phase factor. Many subsequent authors incorrecty cited the date of Fock's paper as 1927. Weyl's 
1929 article along with his 1918 one and Fock's and London's, and other key articles appear in 
translation in a work by O'Raifeartaigljfl with his comments. YanJ^ discusses Weyl's gauge theory 
results as reported by Paul|ll], as a source for Yang-Mills gauge theory (although Yang didn't find 
out until much later that these were Weyl's results): 

I was very much impressed with the idea that charge conservation was related to the 
invariance of the theory under phase changes, an idea, I later found out, due originally 
to H. Weyl. I was even more impressed with the fact the gauge-invariance determined 
all the electromagnetic interactions. 

For the wave equations to be gauge invariant, i.e., have the same form after the gauge transforma- 
tion as before, the local phase transformation ip{:x.) — > ■0(a;)e^*'^"^^^ has to be accompanied by the 
local gauge transformation 

(9) ^ - 9^a(x) 

(The phase and gauge transformations are local because a(x) is a function of x.) This dictates that 
the df^ in the wave equations be replaced by + iqAf^ in order for the d^a{x) terms to cancel each 
other. Thus gauge invariance determines the type of interaction - here, the inclusion of the vector 
potential. This is called the gauge principle and is called the gauge field or gauge potential. 
Gauge invariance is also called gauge symmetry. In electromagnetism, A is the space-time vector 
potential representing the photon field, while in electroweak theory, A represents the intermediate 
vector bosons and fields and in the strong interaction, A represents the colored gluon fields. 
The fact that the q in ij;(x)e~^^'^°'^^^ must be the same as the q in dfj^+iqAfj^ to insure gauge invariance, 
means that the charge q must be conserved^l Thus gauge invariance dictates charge conservation. 
By Noether's theorem, a conserved current is associated with a symmetry. Here the symmetry is 
the non-physical rotation invariance in an internal space called a fiber. In electromagnetism the 

®Weyl, Hermann, (1929). Zeit. f. Physic, 330 56. 
■'London, Fritz, (1927). Zeit. f. Physic, 42 375. 
*Fock, v., (1926). Z. Phys, 39 226. 

'O'Raifeartaigh L., (1997) The Dawning of Gauge Theory, Princeton University. 
^''Yang, C.N., (2005) Selected Papers (1945-1980) With Commentary, pl9, World Scientific. 
"Pauli, W., (1941). Rev. Mod. Physics, 13, 203. 

^^See section 4.6 of Aitchison, I.J.R., and Hey, A.J.G., (1989) Gauge Theories in Particle Physics, Adam Hilger. 
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rotations form the group U(l), the group of unitary 1-dimensional matrices. U(l) is an example of 
a structure group and the fiber is S^, the circle. 

A fiber bundle is determined by two manifolds and the structure group G which acts on the fiber: 
the first manifold, called the total space E consists of many copies of the fiber F, one for each point 
in the second manifold, the base manifold M which for our discussion is the space-time manifold. 
The fibers are said to project down to the base manifold. A principal fiber bundl^ is a fiber 
bundle in which the structure group, G, is a Lie group that acts on the total space E in such a way 
that each fiber is mapped onto itself and the action of an individual fiber looks like the action of 
the structure group on itself by left-translation. In particular, the fiber F is diffeomorphic to the 
structure group G. 

The gauge principle shows how electromagnetism can be introduced into quantum mechanics. The 
transformation ^ + iqAfj_ is also called the minimal principle and the operation + iqAfj_ is 
the covariant derivative of differential geometry, D = d -|- iqA, where A is the connection on a fiber 
bundle. A connection on a fiber bundle allows one to identify fibers over points hi eM via parallel 
transport along a path 7 from bi to 62- In general, the particular identification is path dependent. 
It turns out that the parallel transport depends only on the homotopy class of the path if and only 
if the curvature of the connection vanishes identically. Recall that two paths are homotopic if one 
can be deformed continuously onto the other keeping the end points fixed. 

Weyl in his 1929 paper also includes an expression for the curvature of the connection A, 
namely Cartan's second structural equation which in modern differential geometry notation is 
fi = dA -|- A A A. It is the same form as the equation used by Yang and Mills which in modern 
notation is = dA -|- [A, A], where [] is the Lie bracket. Since the transformations in (9) form an 
abelian group U(l), the space-time vector potential A commutes with itself. Thus in electromag- 
netism the curvature of the connection A is just 

(10) n = dA. 

which, as we will see in the next section, is the field strength F defined as F = dA. 
Differential Geometry 

Differential geometry principly developed by Levi-Civita, Cartan, Poincare, de Rham, Whitney, 
Hodge, Chern, Steenrod and Ehresmann led to the develpment of fiber-bundle theory which is used 
in explaining the geometric content of Maxwell's equations. It was later used to explain Yang-Mills 
theory and to develop string theory. The successes of gauge theory in physics sparked mathemati- 
cians interest in it. In the 1970's Sir Michael Atiyah initiated the study of the mathematics of the 
Yang-Mills equations and in 1983 his student Simon Donaldson using Yang-Mills theory discovered 
a unique property of smooth manifold£l| in M^. Michael Preedman went on to prove that there 
exists multiple exotic differential structures only on M^. It is known that in other dimensions, the 
standard differential structure on is unique. 

In 1959 Aharonov and BohnJ^ established the primacy of the vector potential by proposing an 
electron diffraction experiment to demonstrate a quantum mechanical effect: A long solenoid lies 

^''in giving these definitions, we restrict attention to the smooth manifolds which is adequate for our discussion. 
"Donaldson, S. K. (1983), Bull. Amer. Math. Soc. 8, 81. 
^^Y. Aharonov and D. Bohm, (1959) Phys. Rev. 115, 485. 
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behind a wall with two slits and is positioned between the slits and paralled to them. An electron 
source in front of the wall emits electrons that follow two paths. One path through the upper slit 
and the other path through the lower slit. The first electron path flows above the solenoid and the 
other path flows below it. The solenoid is small enough so that when no current flows through it, the 
solenoid doesn't interfere with the electrons' flow. The two paths converge and form a diffraction 
pattern on a screen behind the solenoid. When the current is turned on, there is no magnetic or 
electric field outside the solenoid so the electrons cannot be effected by these fields; however there is 
a vector potential A and it effects the interference pattern on the screen. Thus Einstein's objection 
to Weyl's 1918 paper can be understood as saying that there is no Aharonov— Bohm effect for 
gravity. Because of the necessary presence of the solenoid, the upper path cannot be continuously 
deformed into the lower one. Therefore, the two-paths are not homotopically equivalent. 

The solution of {l/2m){—ih\/ — qA./cf"ijj+qV'4j = Eip, the time-independent Schrodinger's equation 
for a charged particle, is '0o(x)e^"'/'^'^) ' ' A(y)(is (y) -^j^gj-e tpo{^) is the solution of the equation for 
A equals zero and s(x) represents each of the two paths. Here c is the speed of light and h is 
Plank's constant divided by 2tt. The interference term in the superposition of the solution for the 
upper path and that for the lower path produces a difference in the phase of the electron's wave 
function called a phase shift. Here the phase shift is (q/ch) ^ A(x.)dx. By Stoke's theorem, the 
phase shift is {q/ch)(j) where (j) is the magnetic flux in the solenoid, / B • dS. Mathematically, their 
proposal corresponds to the fact that even if the curvature [the electromagnetic fleld strength] of 
the connection vanishes [as it does outside of the solenoid] parallel transport along non-homotopic 
paths can still be path-dependent [producing a shift in the diffraction pattern]. 

Chamber^ performed an experiment to test the Aharonov and Bohm (AB) effect. The experiment, 
however, was criticized because of leakage from a tapered magnetic needle. TonomursPj et. al. 
performed beautiful experiments that indeed verified the AB prediction. Wu and Yang^ analyzed 
the prediction of Aharonov and Bohm and comment that different phase shifts {q/ch)(p may describe 
the same interference pattern, whereas the phase factor e^^'^^'^^'^'^ provides a unique description. The 
equation e^^^^* = 1, where N is an integer means that e(*9/c?i){0+27rAfcn/g) ^ ^{iq/ch),f>^27TNi ^ ^{iq/ch)4>^ 

Thus fiux of (f),(p + 2Trch/q,(l) + 4TTch/q... all describe the same interference pattern. Moreover, they 
introduced a dictionary relating gauge theory terminology to bundle terminology. For instance, the 
gauge theory phase factor corresponds to the bundle parallel transport; and as we shall see, the 
Yang-Mills gauge potential corresponds to a connection on a principal fiber bundle. 

Let's see how using the primacy of the four-vector potential A, we can derive the homogeneous 
Maxwell's equations from differential geometry simply by using the gauge transformation. Then 
we'll get the non-homogeneous Maxwell's equations for source-free (J = 0) electromagnetism using 
the fact that our world is a four-dimensional (space-time) world. 

We will also show that Maxwell's equations are invariant under the transformations A^ A^ -|- 
d^a{x), or expressed in differential geometry terms, A — > A + da(x). We want a(x) to vanish 
when a function of A is assigned to the E and B fields. Taking the exterior derivative of A will do 
this since d^a(x) = 0. Set A to the 1-form A = -Aodt + Axdx + Aydy + Azdz. Evaluating dA 
and realizing that the wedge product dx* A dx^ = —dx^ A dx* and therefore dx^ A dx^ = where 
dx^ is dt, dx^ is dx, dx^ is dy and dx^ is dz, produces the 2-form dA consisting of terms like 

^'^Chambers, R. G., (1960) Phys. Rev. Lett. 5, 3 

"Tonomura, Akira, et. al., (1982) Phys. Rev. Lett. 48, 1443, and (1986) Phys. Rev. Lett. 56, 792 
"Wu, T. T. and Yang, C. N., (1975) Phys. Rev. D, 12, 3845 
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{dxAo + dtAx)dtdx and {dxAy — dyAx)dxdy. When ah the components are evaluated, these terms 
become respectively VAq + dA/dt and V x A. The analysis up to now has been purely mathe- 
matical. To give it physical significance we associate these terms with the field strengths B and 
E. In electromagnetic theory, two fundamental principles are V • B = (no magnetic monopoles) 
and for time- independent fields E = —VAq (the electromagnetic field is the gradient of the scalar 
potential), so consistency dictates that in the time-dependent case, we assign the two terms to B 
and E respectively: 

(11) B = V X A and E = -VAq - dtA 

The gradient, curl and divergence are spatial operators - they envolve the differentials dx, dy and 
dz. The exterior derivative of a scalar is the gradient, the exterior derivative of a spatial 1-form 
is the curl, and the exterior derivative of a spatial two-form is the divergence. In the 1-form A, 
the -AQdt is a spatial scalar and when the exterior derivative is applied gives rise to V^o- The 
remaining terms in A are the coefficients of dx^ constituting a spatial 1-form and thus produce 
V X A. 

We define the field strength, F as F = dA and from equation (10), we see that the field strength 
is the curvature of the connection A. Using the equations in (11) and the 2-form dA we get 

(12) F = Exdxdt + Eydydt + E^dzdt + Bxdydz + Bydzdx + S^dxdy 
where for example dxdt is the wedge product dx A dt. Since d^A = 

(13) dF = 

Evaluating dF gives the homogeneous Maxwell's equations. In equation (12) since the E part is 
a spatial 1-form, when the exterior derivative is applied, it produces the V x E part of Maxwell's 
homogenous equations. Since the B part of equation (12) is a spatial 2-form, it results in the V • B 
part. Since dF = 0, F is said to be a closed 2-form. 

To get the expression for the non-homogeneous Maxwell's equations, i.e., the equivalent of equation 
(7), we use 

(14) J = pdt + Jxdx + Jydy + J^dz 

and calculate the Hodge dual using the Hodge star operator. The Hodge Duals are defineci^ 
by *Fap = l/2eai3-ysF'^^ and *Jaf3-y = ^af3-ysJ^- The Hodge sta@ operates on the differentials in 
equation (12) and (14) using *{dx^dt) = dx^dx^ and *{dx^dx^) = —dx^dt where i, j, and k refer to 
X, y and z, and are taken in cyclic order. The metric used is (-+++). Thus the Hodge star takes 
a spatial 1-form dx^dt into a spatial 2-form and vice versa with a sign change. 

The non-homogeneous Maxwell's equations are then expressed by 

^^Misner, C. W., Thorne, K. S., and Wheeler, J. A., (1973) Gravitation. Freeman, San Francisco. 
^''Flanders, H., (1963). Differential Forms. Academic Press. 
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(15) d*F = (source-free) 



(15') d*F = *J (non-source-free) 

where the 2-form *F and the 3- form *J are respectively the Hodge duals of F and J. *F and *J 
are defined as 

(16) *F = -B^dxdt - Bydydt - B^dzdt + E^dydz + Eydzdx + E^dxdy 

(17) *J = pdxdydz — Jxdtdydz — Jydtdzdx — Jzdtdxdy 

Thus the Hodge star reverses the rolls of E and B from what they were in F. In *F the coefficient 
of the spatial 1-form is now — B which will produce the curl in the non-homogeneous Maxwell's 
equations, and the coefficient of the spatial 2-form is E which will produce the divergence. In 
M", the Hodge star operation on a p-form produces an (n-p)-form. Thus the form of Maxwell's 
equations is dictated by the fact that we live in a four-dimensional world. When the 1-form A 
undergoes the local gauge transformation A — > A -|- da(x), dA remains the same since d^a = 0. 
Since B and E are unchanged, Maxwell's theory is gauge invariant. 

The Dirac and Electromagnetism Lagrangians 

To prepare for the discussion of the Yang-Mills equations, let's investigate the Dirac and Electro- 
magnetism Lagrangians The Dirac equation is 

(20) (n'^S^ - m)V' = 

where the speed of light, c, and Plank's constant h are set to one. Its Lagrangian density is 

(21) C = ipii'yf'di^ - m)V' 

The Euler-Lagrange equations minimize the action S where S = J £dx. Using the Euler-Lagrange 
equation where the differentiation is with respect to jp, i.e., 

(22) Ma^)) - i = 
yields equation (20). 

The same gauge invariant argument used in the Gauge Invar iance section applies here. In order for 
the Lagrangian to be invariant under the phase transformation V'(x) — ^ ^(x)e'''*"'-^\ this transfor- 
mation has to be accompanied by the local gauge transformation A^ — >■ A^ — e~^5^a(x) and d/^ 
has to be replaced by + ieA^. The Lagrangian density becomes 
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(23) L = ^{i-i^d^, - m)^ - eil)-i^'4)A^ 



The last term is the equivalent of the interaction energy with the electromagnetic field, j^A^. In 
order for the Euler-Lagrangian equation differentiated with respect to to yield the inhomoge- 
neous Maxwell equation (7) we must add — getting 

(24) C = iPini'd^ - m)^ - ei^^i^A^ - {\){F^.f 
The Euler-Lagrange equation yields 

(25) a^F'^'^ = ei^Yi^ 

which equals J'^. Note that the gauge field A^ does not carry a charge and there is no gauge field 
self-coupling which would be indicated by an [^^,^4,^] term in (25). The Lagrangian density does 
not yield the homogeneous Maxwell equations. They are satisfied trivially because the definition 
of F^*^ satisfies the homogeneous equations automatically!^ 

From this it is apparent that the Lagrangian density for the electromagnetic field alone 

(26) £ = -J^^^-(i)(F^,)2 
yields all of Maxwell's equations. 

In differential geometry, if j = 0, this Lagrangian density becomes 

(27) C = {^FA*F) 
The Yang-Mills Theory 

The Yang-Mills theory incorporates isotopic spin symmetry introduced in 1932 by Heisenberg who 
observed that the proton and neutron masses are almost the same (938.272 MeV versus 939.566 MeV 
respectively). He hypothesized that if the electromagnetic field was turned off, the masses would be 
equal and the proton and neutron would react identically to the strong force, the force that binds 
the nucleus together and is responsible for the formation of new particles and the rapid (typically 
their lifetimes are aboutlO"^'' seconds) decay of others. In a non-physical space (also known as an 
internal space) called isospin space, the proton would have isospin up, for instance, and the neutron, 
isospin down; but other than that, they would be identical. The wave function for each particle 
could be transformed to that for the other by a rotation using the spin matrices of the non-abelian 
group SU(2). Because of charge independence, the strong interactions are invariant under rotations 
in isospin space. Since the ratio of the electromagnetic to strong force is approximately a, where 
a = /A-Khc = 1/137, to a good approximation we can neglect the fact that the electromagnetic 
forces break this symmetry. By Noether's theorem, if there is a rotational symmetry in isospin 

■^^ Jackson, J. D., (1998). Classical Electrodynamics, 3rd Ed, p600. John Wiley and Sons. 
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space, the total isotopic spin is conserved. This hypothesis enables us to estimate relative rates 
of the strong interactions in which the final state has a given isospin. The spin matrices turn out 
to be the Pauli matrices cjj. The theory just described is a global one, i.e., the isotopic spin is 
independent of the space-time coordinate and thus no connection is used. We will see that Yang 
and Millj^ elevated this global theory to a local one. In 1954 they proposed applying the isospin 
matrices to electromagnetic theory in order to describe the strong interactions. Ultimately their 
theory was used to describe the interaction of quarks in the electroweak theorj@ and the gluons 
fields of the strong force. In the next section we will give an example using the up quark u which 
has a charge of |e and down quark d which has a charge of -^e. 

We have seen that the field strength (which is also the curvature of the connection on the fiber) is 
given by F = dA + A A A. In electromagnetism A is a 1-form with scalar coefficients for dx* so 
A A A vanishes. If, however, the coefficients are non-commuting matrices AAA does not vanish 
and provides for gauge field self-coupling. Yang and Mills formulated the field strength, using the 
letter B instead if A, so we will follow suit. The field is 

(28) F^, = {d,B^ - d^B,) + ie{B^B, - B,B^) 

or equivalently F^y = {dfj,Bu — dyB^) + ie[Bfj,, B^], where B is the connection on a principal fiber 
bundle, i.e., the gauge potential. So as opposed to the electromagnetic field strength which is lin- 
ear, their field strength is non-linear. They proposed using a local phase. For instance, one could let 

(29) V(x) ^ V(2;)e*°^ W'^' 

where are the Pauli matrices and j goes from 1 to 3. Thus the exponent includes the dot product 
(or inner product) in M.^. The Pauli matrices do not commute, ^] = ie^^^^. Since i?^ = 
or B^ = ib • (where 6^ is called the isotopic spin vector gauge field) the four- vectors B^ and Bi, 
in (28) do not commute. The purpose of the Pauli spin matrices in the connection B is to rotate 
the particles in isospin space so that they retain their identities at different points in M^. Equation 
(28) can be rewritten so that the curvature is defined as F = dB -|-ie[B,B]. As opposed to the 
Maxwell's equations case, the exterior derivative of the curvature dF, does not equal zero because 
of the commutator in the expression for the curvature. Thus the exterior derivative for the 2-form 
F has to be altered to include the connection B. 

The Lagrangian 

(30) C = iPii^/f^iD^ - m)^ - (}£)Tt{F^,F^-) 

is invariant under the gauge transformation for the covariant derivative given as 
(32) = - ieB^ 

where e is the coupling constant analagous to q in (9). The connection i?^ transforms as 
^^Yang, C. N. and Mills, R. L., (1954). Phys. Rev. 96, p91 

^''The weak and electromagnetic forces are the two manifestations of the electroweak force 
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(33) B^^ B^ + e-^d^a+[a,B^], 

the fiber is the sphere, and the structure group is SU(2). 

Since there are three components of the vector gauge field 6^, there are three vector gauge fields 
representing three gauge particles having spin one. They were later identified as the intermediate 
vector bosons and which mediate the electroweak interactions. The fact that there are 
three gauge particles is dictated by the fact that the gauge field is coupled with the three Pauli 
spin matrices. Also, since the charges of the up quark and down quark differ by one, the gauge 
field particles that are absorbed and emitted by them in quark-quark interactions can have charges 
of ±1 or zero. It's astonishing that Yang and Mills in their 1954 paper predicted the existence of 
the three intermediate vector bosons. 

The gauge particles predicted by the Lagrangian (30) have zero mass since any mass term added 
to (30) would make the Lagrangian non-invariant under a local gauge transformation. So the 
force associated with the particles would have infinite range as the photons of the electromagnetic 
interaction do. Of course the weak force (the force responsible for particle decaying slowly, typically 
their lifetimes are about 10"^'' seconds or much less) and strong nuclear force are short range. This 
discrepency was corrected some years later by the introduction of spontaneous symmetry breaking 
in the electroweak SU{2) x U{1) theory of Weinberg, Salam and Glashow (WSG) using the Higgs 
mechanism. The WSG theory, which explains the electromagnetic and weak forces, predicts the 
existence of four gauge bosons: the three massive ones, and and the photon. Moreover, 
it predicts the mass of the (80.37 ±0.03 GeV) and Z° (92 ± 2 GeV), where GeV represents a 
billion electron volts. The was discoverecQ in 1983 (its mass is now reported at 80.425 Gev 
± 0.033 GeV) and later that year the was discovereq^j (its mass is now reported at a mass of 
91.187 ± 0.002 GeV). 

The Euler-Lagrange equations for equation (30) give the Dirac equation 

(34) 7^(a^ - ieB^)i: + = Q 

and also the vector equation for the vector field F, namely 

(35) ^^'Y^, - ie[B^',¥^,] = -^e^^.a^ = -J, 

which, if it weren't for the commutator, is the same form as the non-homogeneous four-vector 
Maxwell equation. The commutator causes the gauge particles to interact with themselves. 

The effect of these equation is explained by 't Hooft El who with Veltman proved the renormaliz- 
ability of Yang- Mills theories. 

...The B quanta would be expected to be exchanged between any pair of particles 
carrying isospin, generating not only a force much like the electro-magnetic force, but 

^''Arnison, G. et. al., (1983). Phys. Lett. 122B, 103 
^^Arnison, G. et. al., (1983). Phys. Lett. 126B, 398 

't Hooft, Gerardus, editor, (2005) 50 Years of Yang-Mills Theory, World Scientific. 
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also a force that rotates these particles in isospin space, which means that elementary 
reactions envolving the transmutation of particles into their isospin partners will result. 
A novelty in the Yang-Mills theory was that the B quanta are predicted to interact 
directly with one another. These interactions originate from the commutator term in 
the Ffj^i, field [equation (35)], but one can understand physically why such interactions 
have to occur: in contrast with ordinary photons, the Yang-Mills quanta also carry 
isospin, so they will undergo isospin transitions themselves, and furthermore, some of 
them are charged, so the neutral components of the Yang-Mills fields cause Coloumb-like 
interactions between these charged particles. 

So the Yang- Mills equations indicate that for instance for the up quark down quark doublet, the 
W~ generates a force that rotates the d into the u in isospin space exhibited by the transition 
d — > M + W~ . The commutator in equation (35) is responsible for interactions like W ^ W + Z 
occurrinj^. and the W can radiate producing a photon vnW + ^. 

The Yang-Mills equations can be derived from the differential geometry Lagrangian density, where 
A; is a constant 

(36) C = -kTriF A*F). 

The Euler-Lagrange equations produce dsF = (the Bianchi identity) and in the absence of 
currents, ds * F = where ds is the exterior covariant derivative. These are the Yang-Mills 
equations in compact form. 

The Feynman Stamp 

In QED after Schwinger, Tomonaga and Feynman addressed the singularites produced by the 
self-energy of the electron by renormalizing the theory, they were then exceedingly successful in 
predicting phenomena such as the Lamb shift and anomalous magnetic moment of the electron. 

Feynman introducecj^ schematic diagrams, today called Feynman diagrams, to facilitate calcula- 
tions of particle interaction parameters. External particles, represented by lines (edges) connected 
to only one vertex are real, i.e., observable. They are said to be on the mass shell, meaning their 
four-momentum squared equals their actual mass, i.e., = E'^ —p^. Internal particles are rep- 
resented by lines that connect vertices and are therefore intermediate states - that's why they are 
said to mediate the interaction. They are virtual and are considered to be off the mass shell. This 
means their four-momentum squared differs from the value of their actual mass. This is done so 
that four-momentum is conserved at each vertex. The rationale for this difference is the application 
of the uncertainty principle AE ■ At = h. Since At, the time spent between external states is very 
small, for that short time period, AE and thus the difference between the actual and calculated 
mass can be large. In the following Feynman diagrams, the time axis is vertical upwards. 

The diagram on the upper-left of the stamp (Figure 1) is a vertex diagram, and as such rep- 
resents a component of a Feynman diagram. It illustrates the creation of an electron-positron 
pair from a photon, 7; it's called pair production. The 7 is represented by a wavy line. The 
Feynman-Stuckelberg interpretation of negative-energy solutions indicates that here the positron, 

^^This is indicated in Figure 1 of the Yang-Mills paper 
Feynman, R. P., (1949), Phys. Rev. 76, 769 
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the electron's antiparticle, which is propagating forward in time is in all ways equivalent to an 
electron going backwards in time. If all the particles here were external, the process would not 
conserve energy and momentum. To see this you must first remember that since the photon has 
zero mass - due to the gauge invariance of electromagnetic theory- its energy and momentum are 
equal. Thus (3 which equals ^ has the value 1; but /5 = ^ so that the photon's velocity is always 
c, the speed of light. In the electron-positron center of mass frame (more aptly called the center 
of momentum frame, since the net momentum of all the particles is zero there), the electron and 
positron momenta are equal and are in opposite directions. The photon travels at the speed of 
light and therefore its momentum cannot be zero; but there is no particle to cancel its momentum, 
so the interaction cannot occur (for it to occur requires a Coloumb field from a nearby nucleus to 
provide a virtual photon that transfers momentum producing a nuclear recoil). Therefore the 7 in 
the diagram is internal. Its mass is off the mass shell and cannot equal its normal value, i.e., zero. 

The diagram on the lower-left of the stamp (Figure 2) is also a vertex diagram and represents an 
electron-positron pair annihilation producing a 7. Again, if all the particles are external, conserva- 
tion energy and momentum prohibits the reaction from occurring. So the 7 must be virtual. 

The diagram (Figure 3) on the bottom to the right of Feynman was meant to represent an electron- 
electron scattering with a single photon exchange. This is called M0ller scattering. (It can, however, 
represent any number of interactions exchanging a photon.) The diagram represents the t-channel of 
M0ller scattering; there is another diagram not shown here representing the n-channel contribution 
where t and another variable s are called the Mandelstam variables. They are used in general 
to describe 2-body — > 2-body interactions. If you rotate the diagram in Figure 3 by 90° you have 
the s-channel diagram for electron-positron scattering called Bhabha scattering shown in Figure 
4 but not on the stamp. Here an electron and positron annihilate producing a virtual photon 
which in turn produces an electron-positron pair. There is also a t-channel contribution to Bhabha 
scattering. The cross section for Bhabha scattering can be easilly obtained from the one for M0ller 
scattering by interchanging the s and u in the cross section expression in a process called crossing. 
Small angle Bhabha scattering is used to test the luminosity in e'^-e~ colliding beam accelerators. 

To the right of the M0ller scattering diagram is a vertex correction to electron scattering shown in 
Figure 5 where the extra photon forms a loop. It is used to calculate both the anomalous magnetic 
moment of the electron and muon, also the anomalous magnetic moment contribution to the Lamb 
shifll^. The other two contributions to the Lamb shift are the vacuum polarization and the electron 

mass renormalization. The Lamb shift explains the splitting in the spectrum of the 25*1 and 2Pi 

2 2 

levels of hydrogen; whereas Dirac theory alone incorrectly predicted that these two levels should 
be degenerate. 

The low-order solution of the Dirac equation predicts a value of 2 for the g-factor used in the 
expression for the magnetic moment of the electron. The vertex correction shown in Figure 5, 
however, alters the g-factor producing an anomalous magnetic moment contribution written as 
When this and higher order contributions are included, the calculated value of for the 
electron is 1159 652 460(127) (75) x lO'^^ ^nd the experimental value is 1159 652 193(10) x 
10~^^ where the numbers in parenthesis are the errors. This seven-significant figure agreement is a 
spectacular triumph for QED. We need not emphasise that the calculations for all these diagrams 
use the gauge principal for quantum electrodynamics. 

The other diagrams on the stamp are all vertex diagrams and show how Feynman's work originally 
^®See for instance pl56, Griffith, David, (f987) Introduction to Elementary Particles Jolin Wiley and Sons. 
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applicable to QED was then later used to elucidate the electroweak force. This is exemplified 
on the stamp by flavor changing transitions, e.g., d + u shown in Figure 6 and flavor 

conserving transitions, e.g., d — > Z° + d of the electroweak force - the u and d quarks have different 
values of flavor. The process in Figure 6 occurs for instance in /? decay where a neutron (udd) 
decays into a proton (udu) and electron and an anti-neutrino. What happens is that the transition 
d ^ u + W~ corresponds to a rotation in isospin space. This rotation is caused by the virtual W~ 
which mediates the decay. It in turn decays into an electron and an anti-neutrino. The calculations 
for these transitions all use the Yang-Mills equations. Although the quarks are confined in the 
hadrons - particles that undergo strong interactions like the proton and neutron - they are free to 
interact with the intermediate vector bosons. 

Who Designed The Stamp? 

Feynman's daughter Michelle was sent a provisional version of the stamp by the United States 
Postal Service and advised on the design of the stamp by among others, Ralph Leighton, coauthor 
with Richard Feynman of two popular books; and Cal Tech's Steven Frautschi and Kip Thorne. 
Frautschi and Leighton edited the Feynman diagrams, and Frautschi rearranged them and composed 
the final design. The person who chose the original Feynman diagrams that form the basis for the 
stamp remains a mystery. 
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Appendix A: Yang-Mills Derivation 

We begin by performing a phase transformation 
(Al) i;' = Sip 

where S = e*"('')'°' and use the covariant derivative = — ieB^ which transforms in the same 
way as indicated in equation (A2) 

(A2) D'lp' = SDtj;. Then 

(A3) (9^ - ieBl)Si; = {d^S)^ + Sd^i, - ieB'^i^ 
But (A3) equals Sd^i) - ieSB^il). 
Cancelling Sd^ij^ on both sides we get, 
(A4) {d^S)%l) - ieB'^Si) = -ieSB^^I;, or 

(A5) -ieB'^S = -ieSB^ - {d^,S) or B'^S = SB,, + {d^S)/{ie), thus 
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(A6) B'^S = SB^ - i{d^S)/e or 
(A7) B'^ = SB^S-^-i{d^S)S-^/e 
For a infinitesimal, S = 1 + ia ■ a, so 

(A8) B'^ = (l + ia- cf)B^(1 - ia ■ a) - z(l/e)5^(l + ia ■ a){l - ia ■ a) and 

Remembering that {a ■ a){b • a) = a ■ b + ia ■ {a X b), setting = a - b^, and since a is infintessimal, 
we drop terms of order getting 

(A9) b'^ ■ a = b/j, ■ a + i[{a ■ (j)ib^ ■ a), (6^ ■ a){a ■ a)] + (l/e)5^(a • a) and finally 

(AlO) b'^ = bn + 2{b^ X a) + {1/ 6)8^0, which is equation (10) in the Yang-Mills paper. 

Pauli, in equation (22a) of Part I of his 1941 Rev. Mod. Phys. article gives the electromagnetic 
field strength as [D^, D^,] = —ieF^y which apart from the minus sign agrees with our conventions 
and where = d^ — ieA/j,. So by following suit, the field strength for the Yang- Mills strength can 
be obtained from the commutator 

(All) [D^, D,] = (5^ - ieB^){d, - ieB,) - {8, - ieB,){d^ - ieB^) 

operating on the wave function ip. Note that —d^{B^xp) = —{d^B^)ip — B^d^ip. So we get an 
apparently extra —Bud^ and a B^dn term. Thus expanding (All) we get 

(A12) d^d^ - ied^B^ - ieBf,d^ - ieB^d^ - e^B^B^ - d^O^ + ied^B^ + ieB^d^ + ieB^d^ + e'^B^B^ 
which reduces to 

(A13) ze(a^S^ - d^B^) - e^[Bf,, B^] or 

(A14) [Dn,Di,] = leF^v where F^j^i, is given by equation (28). 

If we let B^ = a ■ bfj, we can rewrite the equation F^i, = {diyB^ — d^B^) + ie{B^Bi, — BiyB^) as 
(A15) F^^ = (d^B,, - d^B^) + ie{2ia ■ b^ x b^) 
If we further let F^j^n = f^^ ■ a, we get 
(A16) = {d^b^ - d^b,) - 2eb^ X b. 
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which is equation (9) in the Yang-Mills paper. 
Appendix B, Finding the Field Strength 

We reconstruct how one can go about determining the field strength. Since 

f;, = s-'F^,s (1) 

under an isotopic gaude transformation, let's start off with the electromagnetic-like field strength 
in the primed syst em 

= duB'^ d,Bl (2) 

and express it in terms of the non-primed system fields. We calculate d^B'^ from B'^ = S~^Bfj,S + 
iS~^{dnS) / e, equation (A7), obtaining 

d,B'^ = -s-^{d,s)s-^B^s + s-\d,B^)s + s-^B^d,s + 

i/e[-s-^{d^s)s-^d^s + s-^d^d^s] (3) 

So 

d,B'^ - d^Bl = -S-'[id,S)S-'B^ - id^S)S-'B,]S 
+S-^[d^Bi, - d^B^]S + S-'^[Bf,du - B^d^]S+ 



i/e[-s-\d,s)s-^d^s + s-\d^,s)s-^d,s] (4) 

We see that the +S~^[duB^j_ — d^By\S term satisfies equation (1) if the field strength only had 
the electromagnetic-like contribution. The other terms must either represent the transformed non- 
electromagnetic-like part of Fjuj, or be cancelled by adding the non-electromagnetic terms to equa- 
tion (2). Since S is only used for the transformation, it should not appear in the expression for 

The i/e term in equations (4) dictates that a term multiplied by ie be added to equation (2). 
Since S~^{d^S) and S~^di,S appear in the expressions for B'^ and i?^ respectively, the product 
of S~^{d^S) and S~^di,S that appears in the last term of equation (4) suggests that we should 
start our quest to eliminate extra terms in equation (4) by adding leB'^B^ to that equation. This 
product gives 

ieB'^^Bl = ie[S-^B^S + iS-^{d^S) / e]* 

[S-^B^S + iS-^{d^S)/e] (5) 

which equals 
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ieS-^B^B^S - i/eS-\d^S)S-^d„S- 

S-^B^d^S - S-\d^S)S-^B,S (6) 

All but the first term (which represents the transformation of ieB^By) cancel components of the 
extraneous terms in equation (4). And ie{B'^Bl — B^B'^) cancels all of the extraneous terms except 
the transformation of ie{B^Bi, — ByB^). 

After performing the cancellation, we get 

d^B'^ - d^Bl + ieiB^Bl - BIB^) = 
S-\d,B^ - df,B, + ie{B^B, - B,B^)]S (7) 

which satisfies equation (1). 
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Figure 1. A pair production vertex. 




Figure 2. A pair annihilation vertex. 




Figure 3. Electron-electron (M0ller) scattering. 
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Figure 4. Electron-positron (Bhabha) scattering. 




Figure 5. Radiative Correction. 




Figure 6. A flavor non-conserving transition vertex. 
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